NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Plasmons in the Kagome metal CsV3Sb5

https://doi.org/10.1038/s41467-024-49723-x

Shiravi, H; Gupta, A; Ortiz, B R; Cui, S; Yu, B; Uykur, E; Tsirlin, A A; Wilson, S D; Sun, Z; Ni, G X (December 2024, Nature Communications)

Abstract Plasmon polaritons, or plasmons, are coupled oscillations of electrons and electromagnetic fields that can confine the latter into deeply subwavelength scales, enabling novel polaritonic devices. While plasmons have been extensively studied in normal metals or semimetals, they remain largely unexplored in correlated materials. In this paper, we report infrared (IR) nano-imaging of thin flakes of CsV₃Sb₅, a prototypical layered Kagome metal. We observe propagating plasmon waves in real-space with wavelengths tunable by the flake thickness. From their frequency-momentum dispersion, we infer the out-of-plane dielectric function$${{{{{{\boldsymbol{\epsilon }}}}}}}_{{{{{{\boldsymbol{c}}}}}}}$$ $ϵ_{c}$ that is generally difficult to obtain in conventional far-field optics, and elucidate signatures of electronic correlations when compared to density functional theory (DFT). We propose correlation effects might have switched the real part of$${{{{{{\boldsymbol{\epsilon }}}}}}}_{{{{{{\boldsymbol{c}}}}}}}$$ $ϵ_{c}$ from negative to positive values over a wide range of middle-IR frequencies, transforming the surface plasmons into hyperbolic bulk plasmons, and have dramatically suppressed their dissipation.
more » « less
Full Text Available
540 Covid-19-related alterations in racial disparities in dermatology practice patterns

https://doi.org/10.1016/j.jid.2024.06.556

Cui, S; Zhang, L; Xie, Y; Pentland, B; Pentland, AP; Wolf, J Ryan (August 2024, Journal of Investigative Dermatology)

Full Text Available
FLASH: Fast Model Adaptation in ML-Centric Cloud Platforms.

Qiu, H; Mao, W; Patke, A; Cui, S; Wang, C; Franke, H; Kalbarczyk, Z; Başar, T; Iyer, R (September 2024, MLSys)
Gibbons, PhillipB; Pekhimenko, Gennady; De_Sa, Christopher (Ed.)
The emergence of ML in various cloud system management tasks (e.g., workload autoscaling and job scheduling) has become a core driver of ML-centric cloud platforms. However, there are still numerous algorithmic and systems challenges that prevent ML-centric cloud platforms from being production-ready. In this paper, we focus on the challenges of model performance variability and costly model retraining, introduced by dynamic workload patterns and heterogeneous applications and infrastructures in cloud environments. To address these challenges, we present FLASH, an extensible framework for fast model adaptation in ML-based system management tasks. We show how FLASH leverages existing ML agents and their training data to learn to generalize across applications/environments with meta-learning. FLASH can be easily integrated with an existing ML-based system management agent with a unified API. We demonstrate the use of FLASH by implementing three existing ML agents that manage (1) resource configurations, (2) autoscaling, and (3) server power. Our experiments show that FLASH enables fast adaptation to new, previously unseen applications/environments (e.g., 5.5× faster than transfer learning in the autoscaling task), indicating significant potential for adopting ML-centric cloud platforms in production.
more » « less
Full Text Available
FLASH: Fast Model Adaptation in ML-Centric Cloud Platforms.

Qiu, H; Mao, W; Patke, A; Cui, S; Wang, C; Franke, H; Kalbarczyk, Z; Başar, T; Iyer, R (September 2024, MLSys)
Gibbons, Phillip B; Gennady, P; De_Sa, Christopher (Ed.)
The emergence of ML in various cloud system management tasks (e.g., workload autoscaling and job scheduling) has become a core driver of ML-centric cloud platforms. However, there are still numerous algorithmic and systems challenges that prevent ML-centric cloud platforms from being production-ready. In this paper, we focus on the challenges of model performance variability and costly model retraining, introduced by dynamic workload patterns and heterogeneous applications and infrastructures in cloud environments. To address these challenges, we present FLASH, an extensible framework for fast model adaptation in ML-based system management tasks. We show how FLASH leverages existing ML agents and their training data to learn to generalize across applications/environments with meta-learning. FLASH can be easily integrated with an existing ML-based system management agent with a unified API. We demonstrate the use of FLASH by implementing three existing ML agents that manage (1) resource configurations, (2) autoscaling, and (3) server power. Our experiments show that FLASH enables fast adaptation to new, previously unseen applications/environments (e.g., 5.5× faster than transfer learning in the autoscaling task), indicating significant potential for adopting ML-centric cloud platforms in production.
more » « less
Full Text Available
Power-aware Deep Learning Model Serving with µ-Serve. In Proceedings of the 2024 USENIX Annual Technical Conference (ATC 2024).

Qiu, H; Mao, W; Patke, A; Cui, S; Jha, S; Wang, C; Franke, H; Kalbarczyk, Z; Basar, T; Iyer, R (September 2024, Usenix_Atc_24)
Begnum, Kyrre; Border, Charles (Ed.)
With the increasing popularity of large deep learning model serving workloads, there is a pressing need to reduce the energy consumption of a model-serving cluster while maintaining satisfied throughput or model-serving latency requirements. Model multiplexing approaches such as model parallelism, model placement, replication, and batching aim to optimize the model-serving performance. However, they fall short of leveraging the GPU frequency scaling opportunity for power saving. In this paper, we demonstrate (1) the benefits of GPU frequency scaling in power saving for model serving; and (2) the necessity for co-design and optimization of fine grained model multiplexing and GPU frequency scaling. We explore the co-design space and present a novel power-aware model-serving system, μ-Serve. μ-Serve is a model-serving framework that optimizes the power consumption and model serving latency/throughput of serving multiple ML models efficiently in a homogeneous GPU cluster. Evaluation results on production workloads show that μ-Serve achieves 1.2–2.6× power saving by dynamic GPU frequency scaling (up to 61% reduction) without SLO attainment violations.
more » « less
Full Text Available
Power-aware Deep Learning Model Serving with µ-Serve

Qiu, H; Mao, W; Patke, A; Cui, S; Jha, S; Wang, C; Franke, H; Kalbarczyk, Z; Basar, T; Iyer, R (September 2024, Usenix_Atc_24)
Begnum, Kyrre; Border, Charles (Ed.)
With the increasing popularity of large deep learning model-serving workloads, there is a pressing need to reduce the energy consumption of a model-serving cluster while maintaining satisfied throughput or model-serving latency requirements. Model multiplexing approaches such as model parallelism, model placement, replication, and batching aim to optimize the model-serving performance. However, they fall short of leveraging the GPU frequency scaling opportunity for power saving. In this paper, we demonstrate (1) the benefits of GPU frequency scaling in power saving for model serving; and (2) the necessity for co-design and optimization of fine-grained model multiplexing and GPU frequency scaling. We explore the co-design space and present a novel power-aware model-serving system, μ-Serve. μ-Serve is a model-serving framework that optimizes the power consumption and model-serving latency/throughput of serving multiple ML models efficiently in a homogeneous GPU cluster. Evaluation results on production workloads show that μ-Serve achieves 1.2–2.6× power saving by dynamic GPU frequency scaling (up to 61% reduction) without SLO attainment violations.
more » « less
Full Text Available
Covid-19-related alterations in racial disparities in dermatology practice patterns

Cui, S; Zhang, L; Xie, Y; Pentland, BT; Pentland, AP; Ryan_Wolf, J (May 2024, Journal of Investigative Dermatology)

Full Text Available
Eco-friendly passive radiative cooling using recycled packaging plastics

https://doi.org/10.1016/j.mtsust.2023.100448

Liu, Y; Liu, X; Chen, F; Tian, Y; Caratenuto, A; Mu, Y; Cui, S; Minus, ML; Zheng, Y (September 2023, Materials Today Sustainability)

Full Text Available
Multifunctional superelastic graphene aerogels derived from ambient-dried graphene oxide/camphene emulsions

Tian, S.; Zhou, L.; Wu, S.; Jian, R.; Keewan, A.; Cui, S.; Xiong, G (January 2022, Materials letters)

Full Text Available
ML-driven Malware that Targets AV Safety

Jha, S.; Cui, S.; Banerjee, S.; Tsai, T.; Kalbarczyk, Z.; Iyer, R. (June 2020, International Conference on Dependable Systems and Networks)

Full Text Available

« Prev Next »

Search for: All records